A segmental nearest neighbor normalization and gene identification method gives superior results for DNA-array analysis.
نویسندگان
چکیده
An intuitive normalization and gene identification method is proposed. After segmentation of the entire expression range into intensity intervals, the mean and standard deviation of the logarithm of expression ratios are calculated for each interval using the nearest neighbor genes. Genes with high differential expression are excluded from these calculations. For glass arrays, normalization is performed for each interval by using the mean of the logarithm of expression ratios in the interval. For nylonplastic membranes, the average of the means of the logarithm of ratios across the intervals of higher intensities is used for normalization. Compared with other normalization methods, this method delivered the smallest normalization errors for 42 nylonplastic arrays used to analyze cultured T cells and 22 Clostridium acetobutylicum glass arrays. For identifying differentially expressed genes, upper and lower boundaries are constructed for each interval by using the standard deviation of the expression ratio logarithms. When a C. acetobutylicum pSOL1 megaplasmid-deficient strain M5 was used, this method identified more "down-regulated" pSOL1 genes with fewer misidentifications in a comparative array analysis of M5 versus the parent strain. A comparison of quantitative RT-PCR results with different gene identification methods indicates that the proposed method is superior to other methods.
منابع مشابه
Fusion of Different Corneal Parameters to Improve the Diagnosis of Keratoconus
Purpose: To diagnose keratoconus from healthy eyes, as well as suspected keratoconus. Methods: Certain parameters were extracted from Casia, Corvis, and Pentacam HR devices for 3 groups of healthy, with keratoconus, and suspected keratoconus. This study was performed on 340 eyes with keratoconus, 310 normal eyes, and 350 suspected keratoconus. The processing method involved the fusion of featur...
متن کاملA Multi-Strip Algorithm and Its Application to Gene Characterization Using DNA-Array Data
A fast adaptive multiscale algorithm has been devised to characterize a random set of points spanning a high dimensional Euclidean space, but concentrated around special lower dimensional subsets. It has been adapted to analyze gene expression data from microarray experiments. We present here the simplest version of this “multi-strip” algorithm applied to a set of points in R concentrated aroun...
متن کاملIdentification of selected monogeneans using image processing, artificial neural network and K-nearest neighbor
Abstract Over the last two decades, improvements in developing computational tools made significant contributions to the classification of biological specimens` images to their correspondence species. These days, identification of biological species is much easier for taxonomist and even non-taxonomists due to the development of automated computer techniques and systems. In this study, we d...
متن کاملEvaluation Accuracy of Nearest Neighbor Sampling Method in Zagross Forests
Collection of appropriate qualitative and quantitative data is necessary for proper management and planning. Used the suitable inventory methods is necessary and accuracy of sampling methods dependent the inventory net and number of sample point. Nearest neighbor sampling method is a one of distance methods and calculated by three equations (Byth and Riple, 1980; Cotam and Curtis, 1956 and Cota...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 100 3 شماره
صفحات -
تاریخ انتشار 2003